منابع مشابه
Scaling Smoothed Language Models
In Continuous Speech Recognition (CSR) systems a Language Model (LM) is required to represent the syntactic constraints of the language. Then a smoothing technique needs to be applied to avoid null LM probabilities. Each smoothing technique leads to a different LM probability distribution. Test set perplexity is usually used to evaluate smoothing techniques but the relationship with acoustic mo...
متن کاملUsing smoothed K-TSS language models in continuous speech recognition
A syntactic approach of the well-known N-grams models, the K-Testable Language in the Strict Sense (K-TSS), is used in this work to be integrated in a Continuous Speech Recognition (CSR) system. The use of smoothed K-TSS regular grammars allowed to obtain a deterministic Stochastic Finite State Automaton (SFSA) integrating K k-TSS models into a selfcontained model. An efficient representation o...
متن کاملEmoticon Smoothed Language Models for Twitter Sentiment Analysis
Twitter sentiment analysis (TSA) has become a hot research topic in recent years. The goal of this task is to discover the attitude or opinion of the tweets, which is typically formulated as a machine learning based text classification problem. Some methods use manually labeled data to train fully supervised models, while others use some noisy labels, such as emoticons and hashtags, for model t...
متن کامل600.465 — Natural Language Processing Assignment 3: Smoothed Language Modeling
Probabilistic models are an indispensable part of modern NLP. This assignment will try to convince you that even simplistic and linguistically stupid models like n-gram models can be very useful, provided their parameters are estimated carefully. See reading section A. You now know enough about probability to build and use some trigram language models. You will experiment with different types o...
متن کامل601.465/665 — Natural Language Processing Assignment 3: Smoothed Language Modeling
Probabilistic models are an indispensable part of modern NLP. This assignment will try to convince you that even simplistic and linguistically stupid models like n-gram models can be very useful, provided their parameters are estimated carefully. See reading section A. You now know enough about probability to build and use some trigram language models. You will experiment with different types o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Speech Technology
سال: 2005
ISSN: 1381-2416,1572-8110
DOI: 10.1007/s10772-006-9047-5